RAW SEQUENCE LISTING 



ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: ro| 0S6 

Source: pg^f/Q 
Date Processed by STIC: ^1<:(dV 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www,uspto.gov/web/oflices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http;//www-uspto,gov/ebc/efs/downloads/documeiits.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S: Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



• 



Raw Sequence Listing Error Summary 



•RROR DETECTED 

lTTN: NEW RULES CASES: 

1 Wrapped Nuclei cs 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



.Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



Misaligned Amino 
Numbering 

Non- ASCII 



_ Variable Length 



Patentln 2.0 
"bug" 



_Use of n's or Xaa*s 
(NEW RULES) 



Invalid <213> 
Response 



_Uscof<220> 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved In ASCII text. 

Sequence^) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>~<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences Sequences) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



_Skippcd Sequences Sequences) 
(NEW RULES) 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

•* * '~ * 

Per 1.823 of Sequence tfules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequencc(s) _ 



missing the <220> "Feature** and associated numeric identifiers and responses. 



Patentln 2.0 
"bug" 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is -Artificial Sequence" or 
-Unknown.- Please explain source of genetic material in <220> to <223> sectioa 
(See "Federal Register" 06701/1998, Vol. 63, No. 104, pp. 29631-32) (Sec 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use -File Manager" or any other manual means to copy file to floppy disk. 



Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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DOSS Not Comply 
Corrected f ;skotte Needed 



PCT10 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/030 , 678 



DATE: 08/05/2002 
TIME: 11:23:38 
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Input Set : A:\56761seq.txt 

Output Set: N:\CRF4\08052002\J030678.raw 

<110> APPLICANT: SUGIYAMA, Hiroshi 
BANDO, Toshikazu 
IIDA, Hirokazu 
SAITO, Isao 

<120> TITLE OF INVENTION: INTERSTRAND CROSSLINKING AGENTS FOR DNA 

AND COMPOUNDS THEREFOR 
<130> FILE REFERENCE: 56761 (71526) 
<140> CURRENT APPLICATION NUMBER: 10/030,678 
<141> CURRENT FILING DATE: 2002-01-11 
<150> PRIOR APPLICATION NUMBER: PCT/JP01/03 756 
<151> PRIOR FILING DATE: 2001-05-01 
<160> NUMBER OF SEQ ID NOS : 10 

<170> SOFTWARE: FastSEQ for Windows Version 3.0 

<210> SEQ ID NO: 1 S 
<211> LENGTH: 5 
<212> TYPE: DNA™— 

<213> ORGANISM: (Nucleic_ Aci d Interstrand-Crosslinking A gents 
<400> SEQUENCE: 1 ~~ " ~~ 

cgacg 5 
<210> SEQ ID NO: 2 
<211> LENGTH: 18 

<212> TYPE: DNA _ ~ 

<213> ORGANISM: /^htcjre^^^cTd^nterstrand- Cross linkin g Agents 

<400> SEQUENCE:^— ^ ' ' ' " 

ttacagtggc tgccagca 18 

<210> SEQ ID NO: 3 

<211> LENGTH: 18 — - - - — — 

<212> TYPE: DNA 

<213> ORGANISM£>ffucleic Acid Interstrand-Crosslinking Agents 

<400> SEQUENC: ^ ^ _ 

ttatgctggc agccactg 18 
<210> SEQ ID NO: 4 
<211> LENGTH: 14 
<212> TYPE: DNA 

<213> ORGANISM^ Nucleic A cid Interstrand- Crosslinking Agents 
<400> SEQUENCTU- 4r 




ttacagtggc tgcc 
<210> SEQ ID NO: 5 
<211> LENGTH: 17 
<212> TYPE: DNA - 
<213> ORGANISM.CciiiJ 
<400> SEQUENCE: 5 

ttacagtggc gccagca 



14 



*ic Acid Interstrand-Crosslinking Agents 



[e://C:\CRF4^Outhold\VsrJ030678.htm 



8/5/02 



RAW SEQUENCE LISTING DATE: 08/05/2002 

PATENT APPLICATION: US/10/030 , 678 TIME: 11:23:38 



Input Set : A:\56761seq.txt 

Output Set: N:\CRF4\08052002\J030678.raw 
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<210> SEQ ID NO: 

<211> LENGTH: 18 

<212> TYPE: DNA 

<213> ORGANISM:/ Nucl 



18 



19 



eic Acid Int erstrand-Cros slinking Agents^ 
<4 00> SEQUENCE: 6 ~ 

ttacagtggc tgccagca 
<210> SEQ ID NO: 7 
<211> LENGTH: 19 

<212> TYPE: DNA ^ 

<213> nwflAWTSM/w uglelc Acid 
<400> SEQUENCE 5* 7 

ttacagtggc ttgccagca 
<210> SEQ ID NO: 8 
<211> LENGTH: 20 
<212> TYPE: DNA ^ 

<213> ORGANISM: (jfucleic Acid I nterstrand-Cro sslinking Agents 

<400> SEQUENCE: 8 "~~ "~ * 

ttacagtggc tttgccagca 20 
<210> SEQ ID NO: 9 
<211> LENGTH: 14 
<212> TYPE: DNA 

<213> ORGANISM : ( Nuclei c Acid In terstrand-Crosslinking Agents 
<400> 



)NA 

3M:Cnuc1 

:e: 



14 



SEQUENCE : 
ttacagtggc tgcc 
<210> SEQ ID NO: 10 
<211> LENGTH: 9 
<212> TYPE: DNA 

<213> ORGANISM/ Nucleic AcJ,0-IixJLeistrand--Crosslinking Agents 



<400> SEQUENCE: 



103 tggctgcca 



/C:\CRF4\Outhold\VsrJ030678.htm 



